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LINEAR EQUATIONS WITH RATIONAL FRACTIONS OF 
BOUNDED HEIGHT AND STOCHASTIC MATRICES 


IGOR E. SHPARLINSKI 


Abstract. We obtain a tight, up to a logarithmic factor, upper 
bound on the number of solutions to the equation 


n 



with variables r ±,..., r n in an arbitrary box at the origin and 
variables Si,..., s n in an essentially arbitrary translation of this 
box. We apply this result to get an upper bound on the number 
of stochastic matrices with rational entries of bounded height. 


1. Introduction 


1.1. Motivation. We recall that a stochastic matrix is a square n x n 
matrix A = (o:jj)”- =1 with nonnegative entries and such that 


n 



(i) 


i — 1,..., n. 


Furthermore, as usual, for a rational number a we define its height 
h(cii) as max{|s|,r} where the integers r, s G Z are uniquely defined 
by the conditions a = s/r, gcd(r, s) — 1, r > 1. 

We now define S n (H ) as the number of stochastic n x n matrices 
with rational entries of height at most //, that is, with entries from 
the set 


T(H) = {a e Q : a > 0, h(a) < H}. 


In particular, J-(H) D [0,1] is the classical set of Farey fractions of 
order H. 
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The question of estimating of S n (H ) seems to be quite natural, how¬ 
ever it has never been addressed in the literature. Since the condi¬ 
tions (1) are independent, we clearly have 

(2) S n (H) = L n (H) n , 

where L n (H) is the number of solutions to the linear equation 

n 

^2 a i = 1, a j eJ r {H),j = l,...,n, 

3 =1 

which we also write as 

n 

( 3 ) ^ — = 1 H >rj > Sj > 0, gcdfo, Sj) = 1, j = 1,..., n. 

j=l Tj 

We remark that each solution to (3) leads to an integer solution to 

n 

( 4 ) = °’ \ m j\ - 

3=1 Kj 

with K = 2 H. Indeed, it is enough to set ( kj,rrij ) = ( Vj,Sj ), j = 
1,..., n — 1, and (k n , m n ) = (r n , s n — r n ). Furthermore, distinct solu¬ 
tions to (3) yields distinct solutions to (4). 

We now recall that for n = 3, the result of Blomer, Briidern and 
Salberger [2] gives an asymptotic formula K 3 Q(K) + 0(K 3 ~ 5 ) for the 
number of solutions to (4), where Q G Q[W] is a polynomial of degree 
4 and S > 0 is some absolute constant. This immediately implies the 
bound 

L 3 (H) = 0(H 3 (logH) 4 ). 

It is also noted in [2] that the same method is likely to work for any n, 
it may also work for the equation (3) directly and give an asymptotic 
formula for L n (H). The more elementary approach of Blomer and 
Briidern [1] can probably be used to get an upper bound on L n (H) 
of the right order of magnitude. However working out the above ap¬ 
proaches from [1, 2] in full detail may require significant efforts. Here we 
suggest an alternative way to estimate L n (H) via modular reduction 
modulo an appropriate prime and bounds on some double exponen¬ 
tial sums with rational fractions. Although the bound obtained via 
this approach does not reach the same strength as the hypothetical 
results that can be derived via the methods of Blomer, Briidern and 
Salberger [2] or Blomer and Briidern [1], it is weaker only by a power 
of a logarithm. 


LINEAR EQUATIONS WITH RATIONAL FRACTIONS 


3 


Furthermore, the suggested here method seems to be more robust 
and also applies to more general equations, see (7) below. Besides it 
also works for variables in distinct intervals and not necessary at the 
origin. However it is unable to produce asymptotic formulas. 

Clearly, to estimate S n (H ) one can use general bounds for the num¬ 
ber of integral points on hypersurfaces, see, for example, [7] and refer¬ 
ences therein, however they do no reach the strength our results. 

1.2. Main results. We derive the bounds L n (H) and thus on S n (//) 
from a bound on the number of solutions of a much more general linear 
equation. 

Namely, we consider two boxes of the form 

(5) 03 o = [1, Hi] x ... x [1, H n ] 
and 

(6) 03 = [B\ + 1, B\ + Hi] x ... x \B n + 1, B n + H n ], 

with arbitrary integers Aj, Bj and positive integers Hj, i — 1,... ,n. 
We note that the boxes 03o and 03 are of the same dimensions but 03o 
is positioned at the origin, while 03 can be at an arbitrary location in 
M n . 

Namely, for a vector a vector a = (a 0 , cq,..., a n ) G Z n+1 we use 
N n ( a; 03 o , 03) to denote the number of solutions to the equation 

n 

(7) Yaj-L - q 0 , (rr,... ,r n ) G 03 o , (si,... ,s n ) G 03. 

i =i Tj 

We remark that we have droped the condition of co-primality of the 
variables, which only increases the number of solutions, but does not 
affect our main results. 

Throughout the paper, any implied constants in the symbols O , <C 
and may depend on the real parameter e > 0 and the integer 
parameter n > 1. We recall that the notations U = 0(V), U -C V 
and 7 > 1/ are all equivalent to the statement that the inequality 
\U\ < cV holds with some constant c > 0. 

Theorem 1. Let 03 and 03 0 be two boxes of the form (5) and (6), respec¬ 
tively, with max J=1 .. n Hj = H. For any vector a = (a 0 , oq,..., a n ) G 
Z n+1 with 1 < |a, ; | < exp ( H °( 1 )) ; i — 1,..., n, we have 

N n ( a; 03 o , 03) < H x ... H n (logH) 2n - 1+0 ^. 

We remark that in Theorem 1 there is no restriction on the size of 
non-vanishing of ao on the right hand size of (7). 
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Corollary 2. We have 

H n2 < S n (H) < H n2 (logH) n ^~ 1)+oW . 

We derive these results via modular reduction of the equation (7) 
modulo an appropriately chosen prime p. In turn, to estimate the num¬ 
ber of solutions of the corresponding congruence, we use a new bound 
of double exponential sums, which slightly improves [9, Lemma 3]. We 
present this result in a larger generality than is need for proving The¬ 
orem 1 as we believe it may be of independent interest. 

Furthermore, since there is a gap between upper and lower bounds of 
Corollary 2 it is natural to do some numerical experiments and try to 
understand the asymptotic behaviour of S n (H) as H —* oo. Thus, it 
is interesting to design an efficient algorithm to compute S n (H ). Since 
by (2), it is enough to compute L n (H) we see that one can do this via 
the following naive algorithm: for each choice of n — 1 positive integers 
r, < H, i = 1,... ,n — 1, then each choice of n — 1 nonnegative integers 
Si < Vi, with gcd(sj, — 1 , i — 1 ,..., n — 1 , check whether 

1 - si/n - ... - s n -i/r n _i e T(H). 

This obviously gives an algorithm of complexity // 2n ~ 2 +°( 1 ). We now 
show that one can compute S n (H) faster. 

Theorem 3. There is a deterministic algorithm to compute S n (H) of 
complexity // 3n / 2 ~ 1 +°( 1 ). 


2. Preliminaries 

2.1. Background on totients. Let <p(r) denote the Euler function. 

We need the following well-known consequence of the sieve of Er¬ 
atosthenes. 

Lemma 4. For any integers r > £ > 1, 

■£ l = hffi + 0( r "M). 


Proof. For an integer d > 1 we use p(d) to denote the Mobius function. 
We recall that /x(l) = 1, p(d) = 0 if d > 2 is not square-free, and 
p(d) = (—l) 6u ’( d ) otherwise, where ui(d) is the number of prime divisors 
of d. 
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Now, using the Mobius function fi{d ) over the divisors of r to detect 
the co-primality condition and interchanging the order of summation, 
we obtain 


l 


£ 

S = 1 

gcd(s,r)=l 


1 = 

d\r 



‘j2^-+o 


d\r 


d 



We now use the well-known identity 

M = ip{r) 
d r ’ 

d\r 

see [4, Section 16.3] and also that and that 

£|M<*) | = 2“M 

d\r 

see [4, Theorem 264], which yield 

1 = ^-£ + 0 ( 2 ^). 

S =1 

gcd(s,r)=l 

Since obviously co(r)\ < r, the result now follows immediately. □ 



One can certainly obtain a much more precise version of the following 
statement, which we present in a rather crude form that is, however, 
sufficient for our applications. 

Lemma 5. For any n > 1 we have 

H 

£ Ur )”' 1 » H ". 

r =1 

Proof. By the Holder inequality, 

/ h \ n ~ l H 

£wo <^“- 2 £wr‘. 

\ r=l ) r =1 

Using the classical asymptotic formula 

E^ r ) = (^ + o(1) ) R2 ' 

r =1 ' ' 

see [4, Theorem 330], we conclude the proof. □ 
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2.2. Background on divisors. For an integer m > 1, we use r(m) to 
denote the divisor function 

r(m) = #{d 6Z : d > 1, d \ m} 

and A(m) to denote the Hooley function 

A(m) = max #{d G Z : u < d < eu, d \ m}. 

u> 0 

We need upper bounds on the average values of r(m) and A(m), 
where m runs through terms of arithmetic progressions indexed by 
prime numbers. We derive these bounds from very general results of 
Nair and Tenenbaum [8]. 

We start with the r(m). 

Lemma 6. For any fixed real e > 0 and integer n > 1, for positive 
integers a, m and M > max{a,m E }, we have 

r(a +pm)" < M(logM) 2 " -2 . 

M<p<2M 
p prime 


Proof. We apply [8, Theorem 3] (taken with the polynomials Q(X) = 
mX + a and x = y = M), and derive 


Y, r(a + pm) n <C 

M<p<2M 
p prime 


M 

log M 


n 

p<M 
p prime 



E 

m<M 


r(m) n 

m 


where the implied constant depends only on e and n. Using the 
Mertens formula for the product over primes, and also the classical 
bound of Mardjanichvili [6] 


(8) Y, r ( m ) n < M(logM) 2n_1 

m<M 


(combined with partial summation), we easily derive the result. □ 


For A (m) we obtain a slightly stronger bound. 

Lemma 7. For any fixed real £ > 0 and integer n > 1, for positive 
integers a, m and M > max{a,m E }, we have 

Y A (« + P m T < M(log M) 2 ”- n - 2+o(1) . 

M<p<2M 
p prime 
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Proof. As in the proof of Lemma 6, by [8, Theorem 3] we have 


A (a + pm) n <C 

M<p<2M 
p prime 


M 

log M 


n 

p<M 
p prime 



A [m) n 
m 


where the implied constant depends only on £ and n. Now, using [10, 
Lemma 2.2] instead of (8) we conclude the proof. □ 


We remark that using the full power of [8, Theorem 3] and [10, 
Lemma 2.2], one can replace o(l) in the power of log M in Lemma 7 
by a more precise and explicit function of M. 


2.3. Product and least common multiples of several integers. We 

need the following result of Karatsuba [5] 

For an integer n > 1 and real Ri ,..., R n , let J n (R \,..., R n ) denote 
the number of solutions to congruence 

(9) rr ... r n = 0 (mod lcm [r\ ... r 2 ]) 

in positive integer Tj < Rj , j — 1,... ,n. 

Lemma 8. For any real R\,, R n > 2, we have 

J n (R u ...,R n )^R 1/2 (\ogR) n2 , 

where R = R\... R n . 


2.4. Exponential sums with ratios. For a prime p, we denote e p (z) = 
exp(27r iz/p). Clearly for any p > u,v > 1 the expression e p (av/u ) is 
correctly defined (as e p (aw ) for w = v/u (mod p)). 

The following result is a variation of [9, Lemma 3], where the addi¬ 
tional averaging over primes allows us to replace with a power of 

logQ. 


Lemma 9. Let Q > U, V > 1 be arbitrary integers and let C C [0, U] x 
[0, V] be an arbitrary convex domain. Then, uniformly over the integers 
a G [1, 2 Q\, we have 


n 


Y e p ( av / u ) 

Q<p<2Q ( u,v)eC 
p prime 
gcd(a,p)=l 


< (U + V) n Q(logQ ) 2n - 2+ °^. 
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Proof. Since C is convex, for each v we there are integers V > W u > 
V u > 0 such that 

u w u 

Y e p (av/u ) = Y e pi av / u )- 

( u,v)£C u= 1 u=V [1 +l 


Following the proof of [9, Lemma 3], we define 

I = \ \og{2Q/V)\ and J = [log(2Q)J . 


Furthermore, for a rational number a = v/u with gcd (u,p) = 1, 
we denote by p p (a) the unique integer w with w = v/u (mod p) and 
|tu| <p/2. Then [9, Equation (1)] implies 

j 

Y e p(av / u) <C VR p + Q Y, Tj tP e ~ j , 

(u,v)eC j=i+1 


where 

R P = #{u : 1 < u < U, \p p (a/u)\ < e 7 } , 

Tj, P = # {u : 1 < u < U, e- 7 < \p p {a/u)\ < e 7-1 " 1 } . 


Thus, using the Holder inequality twice, we obtain: 

j 


Y e p( au / 

( u,v)£C 


u 


« v-r; + Q n [Y. T U 




\j = I +1 


< v n R n p + g n (io g g ) n - 1 Y t k 

3=1 +1 


™ p ~3™ 


Hence 


( 10 ) 


where 


n 


Y e p( av / u ) 

Q<p<2Q ( u,v)GC 
p prime 
gcd(a,p)=l 


J 

^V n y\pQ n (\ogQ) n ~ 1 Y ^3 e ~ jn i 

3=1 +1 


91= £ J?" 

Q<p<2Q 
p prime 
gcd(o,p)=l 


and T,= V 7™. 

Q<p<2Q 
p prime 
gcd(o,p)=l 


As in [9], we note that if e- 7 < \p p (a/u)\ < e J+1 , then uz = a 
(mod p) for some integer z with e- 7 < \z\ < e- 7 " 1 " 1 . Thus uz = a+pk for 
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some integer k with \k\ < Kj where Kj = \e^ +1 U/Q\ + 1. Therefore, 
recalling the definitions of the r(m) and A(m), we conclude 

R p < r(\a + pk\) and T^ p < A(|a + pfc|). 

Now using the Holder inequality, changing the order of summation and 
applying Lemma 6 (which applies as Q Kj > |fc|) we derive 

D\< ( J2 T (\ a + P k \) 

Q<p<2Q \ |fc|<Kj 
p prime 

<Kf-' E T(|a + p*:|)"«/i- f “Q(logQ) 2 "- 2 . 

\k\<Kj Q<p<2Q 
p prime 

Similarly, applying Lemma 7, we see that 

A (\ a +P k \) 

Q<p<2Q V \k\<Kj 
p prime 

< A”'- 1 ^ A (|a + p/c|) n -C iL”<5(log <5) 2n_ri_2+0 ^ 1 \ 

|fc|</y q<p<2Q 
p prime 




for j = I + 1,..., J. Substituting this bound in (10), yields 


n 


(ii) 


e p ( av / u ) 

Q<P<2Q (u,v)GC 
p prime 
gcd(a,p)=l 


J 

<ci/ n AT7g(iogg) 2ri - 2 + g n+1 (iogg) 2ri - 3+o(1) ^ e~ jn iq. 

3 = 1 +1 


We now have 

K? <C e In U n Q~ n + 1 <C U n V~ n + 1 

and also 

j j 

e~ jn KJ < ( e jn U n Q~ n + l) e~ jn 

j=i +i j=i+i 

< J7/ n /g n + e“ 7n < U n Q~ n log Q + H n g- n . 

Combining the above bounds with (11), after simple calculations, we 
obtain the desired result. □ 
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3. Proofs of Main Results 

3.1. Proof of Theorem 1. We note that since |ai... a n \ = exp (if°b)), 
this product has at most H° b) prime divisors. Hence, there is a con¬ 
stant C > 0 such that for Q = \H C ~\ there is a set V of at least 

(12) #R>0.5Q/logQ 

primes p G [Q,2Q\ that are relatively prime with a\...a n . We also 
assume that C > n. In particular, 

(13) » Q > H n . 


Let M n (a;p, QSoj 53) be the number of solutions to the congruence 

n 

^aj— =a 0 (mod p), (rr, ..., r n ) e 53 0 , (si,..., s n ) e 53. 
j =i r i 


Using the orthogonality of exponential functions, we write 

1 P_1 ( ( n 

M n (a;p, 53 0 ,53) = ~ e P ( A \Y 1 ~ a ° 

(r-i,...,r n )e < Bo P A=0 V \j=l j 

Changing the order of summation, gives the identity 

^ p —1 n Hj Bj+Hj 

M n (a;p,53 0 ,53) = e p (-Aa 0 ) e p ( Aa i s i/ r l) • 

P A=0 j=l rj=1 sj=Bj +1 



Now, the contribution from A = 0 gives the main term (Hi . .. H n ) 2 /p. 
Extending the summation over A to all positive integers A < 2 Q with 
gcd(A,p) = 1, for every p e V we obtain 


M n { a; p, 5S 0 , 53) 


< 


H n f 

Q 



2 Q n 

e n 

A=1 j= 1 
gcd(A,p)=l 


Hj Bj+Hj 

yi e p(^ a j s j/ r j) 

v j —— 1 Sj — B j 1 


Hence, summing over all p E V and denoting 


W( A) 


2Q n 

e n 

p£V j =1 

gcd(A,p)=l 


Hj Bj +Hj 

5^ X] W (AUj-Sj/Tj) 

T 'j — 1 S j — B j 1 
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we see that 


2 Q 


(14) + £ W( A). 


p&V 


A=1 
gcd(A,p)=l 


Using the Holder inequality, we obtain 


n 2 Q 


W(xr < n E 


3 = 1 PG^ 

gcd(A,p)=l 


^ ^ e P (XajSj/r 


rj =l Sj=Bj +1 


We now invoke Lemma 9 and see that 

W(A) « H x .. .H n Q(logQ) 2n - 2+ °^, 
which together with (14) implies 

]T M n ( a; p, ® 0 , ®) « #T {Hl - Q HnY +H 1 ... H n Q( log Q) 2 "- 2+ °« 


p£V 


Hence, by (12), there is a prime p E V with 

M n (a;p, ©o, 03) « (i/l - g ^ n)2 + U, ... /^(logQ) 2 "-^ 1 ). 

Recalling (13), we see that the second term dominates and logQ can 
be replaced with log H . Therefore, 

M n (a;p, ©o, ®)«tfi... tf„(log #r~ 1+o(1) . 

Using the trivial bound lV n (a; 03 o , 53) < M n (a;p, 53 0 , 53), we conclude 
the proof. 


3.2. Proof of Corollary 2. The upper bound is immediate from Theo¬ 
rem 1 and the equation (2) (we note that for the typographical simplic¬ 
ity the values Sj = 0 are excluded in Theorem 1, but a simple inductive 
argument allows us to include them). 

To see the lower bound we note that by Lemma 4 for any positive 
integer r < H and i = \r/(n — 1)J and we can choose positive integers 
si,..., s n _i < i with 

(15) gcd(si...s n _i,r) = l 

in 

f(l + o(l))^fj X^r )"- 1 

possible ways. After this we set s n = r — si — ... — s n __i. Clearly, 
by the co-primality condition (15), the vectors of rational numbers 
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(si/r,... ,s n /r ) obtained via the above construction are pairwise dis¬ 
tinct. Hence 

H 

r =1 

Applying Lemma 5 we obtain L n (H ) iP and the result follows 
from (2). 

3.3. Proof of Theorem 3. Clearing the denominators, we transform (3), 
into the following equation 


£• 

3 = 1 


r\ ■ ■ ■ r r 


= n 


Since for every j — 1,..., n we have gcd(rj, Sj) = 1, we see the divisi¬ 
bility 

, P ...r n 


or rj | ri...r n , which implies the congruence (9) for every solution 
to (3). 

Using standard arithmetic algorithms for computing the greatest 
common divisor and this the least common multiple, we see that we 
can enumerate all vectors of positive integers (r\,. .., r n ) for which (3) 
has a solution for some s\,... ,s n in time O (LP+U 1 )). By Lemma 8, 
the resulting list contains 0(// n / 2 +°( 1 )) vectors. 

Now for each (ri,..., r n ) we choose 0 < Sj < Tj, j — 1,..., n — 1 in 
O (LP -1 ) ways, define s n by the equation (3) and check whether other 
conditions in (3). This leads to the desired algorithm. 


4. Comments 

We remark that estimating L n (H) by the number of solutions to 
an equations of the type (7) (that is, without the co-primality condi¬ 
tion) can lead to additional logarithmic losses. This effect has been 
mentioned in [2] and can also be easily seen for n — 2. 

We recall that a square n x n matrix A is called doubly stochastic 
if both A and the transposed matrix A T are stochastic. 

We now define S n (H) as the number of doubly stochastic n x n 
matrices with rational entries from J-(H). We have the following trivial 
bounds 

>(log p)( n - 1 )( 2 ™- 1 )+°( 1 ). 


(16) 


jjn 2 ‘-2n-\-2 


« S n (H) < H r 
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Indeed the upper bound in (16) follows immediately from Theorem 1 
and the observation that if the top n — 1 rows of a doubly stochastic 
matrix are fixed then the last row is uniquely defined. To get a lower 
bound on S n (H ), we fix a positive integer r and for each i = 1,, n— 1 
we choose the elements a h j = Sij/r, i = 1, ... ,n — 1, j — 1,... ,n, of 
the first n — 1 rows as in the proof of Corollary 2 (with respect to the 
same r). After this we also define 

n—1 

OL n ,j 1 ''y ^ CKjJ) j 1, . . . , 72. 

i =1 


It only remains to note that 

n n n—1 


n—1 n 


£ 

3 = 1 


^ n,j ^ 


a i,j 71 1' 


j=l i=l 


i=l 3 =1 


Now, simple counting yields the lower bound in (16). 
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